F2Net: Learning to Focus on the Foreground for Unsupervised Video Object Segmentation
نویسندگان
چکیده
Although deep learning based methods have achieved great progress in unsupervised video object segmentation, difficult scenarios (e.g., visual similarity, occlusions, and appearance changing) are still no well-handled. To alleviate these issues, we propose a novel Focus on Foreground Network (F2Net), which delves into the intra-inter frame details for foreground objects thus effectively improve segmentation performance. Specifically, our proposed network consists of three main parts: Siamese Encoder Module, Center Guiding Appearance Diffusion Dynamic Information Fusion Module. Firstly, take siamese encoder to extract feature representations paired frames (reference current frame). Then, Module is designed capture inter-frame (dense correspondences between reference frame), intra-frame original semantic frame. Different from Anchor Network, establish Prediction Branch predict center location leverage point information as spatial guidance prior enhance extraction, representation considerably focus objects. Finally, automatically select relatively important features through aforementioned different level features. Extensive experiments DAVIS, Youtube-object, FBMS datasets show that F2Net achieves state-of-the-art performance with significant improvement.
منابع مشابه
Unsupervised NN approach and PCA for Background – Foreground video segmentation
MPEG-4 based video coding applications require the segmentation of each video image in its principal moving objects to be coded independently from each other. Several techniques of video objects segmentation for coding purposes have been presented in literature; all such segmentation techniques are based on the smart soft-thresholding of the motion fields, the best ones dealing with dense motio...
متن کاملInstance Embedding Transfer to Unsupervised Video Object Segmentation
We propose a method for unsupervised video object segmentation by transferring the knowledge encapsulated in image-based instance embedding networks. The instance embedding network produces an embedding vector for each pixel that enables identifying all pixels belonging to the same object. Though trained on static images, the instance embeddings are stable over consecutive video frames, which a...
متن کاملa novel unsupervised approach for minimally-invasive video segmentation
background: laparoscopy or minimally invasive surgery is a surgical procedure in which laparoscope and other surgical instruments are inserted inside body via a few small incisions. laparoscope is used to look inside the patient's body and records displayed images. temporal segmentation of laparoscopic videos has many applications like detecting laparoscopic anomalies and interrupts. it is prer...
متن کاملA Neural Network based Scheme for Unsupervised Video Object Segmentation
In this paper, we proposed a neural network based scheme for performing unsupervised video object segmentation, especially for videophone or videoconferencing applications. The procedure includes (a) a training algorithm for adapting the network weights to the current condition, (b) a Maximum A Posteriori (MAP) estimation procedure for optimally selecting the most representative data of the cur...
متن کاملSmooth Foreground-Background Segmentation for Video Processing
We propose an efficient way to account for spatial smoothness in foreground-background segmentation of video sequences. Most statistical background modeling techniques regard the pixels in an image as independent and disregard the fundamental concept of smoothness. In contrast, we model smoothness of the foreground and background with a Markov random field, in such a way that it can be globally...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i3.16308